practical application
Planning with General Objective Functions: Going Beyond Total Rewards
Standard sequential decision-making paradigms aim to maximize the cumulative reward when interacting with the unknown environment., i.e., maximize $\sum_{h = 1}^H r_h$ where $H$ is the planning horizon. However, this paradigm fails to model important practical applications, e.g., safe control that aims to maximize the lowest reward, i.e., maximize $\min_{h= 1}^H r_h$. In this paper, based on techniques in sketching algorithms, we propose a novel planning algorithm in deterministic systems which deals with a large class of objective functions of the form $f(r_1, r_2, ... r_H)$ that are of interest to practical applications. We show that efficient planning is possible if $f$ is symmetric under permutation of coordinates and satisfies certain technical conditions. Complementing our algorithm, we further prove that removing any of the conditions will make the problem intractable in the worst case and thus demonstrate the necessity of our conditions.
Conditional Diffusion Process for Inverse Halftoning
Inverse halftoning is a technique used to recover realistic images from ancient prints (\textit{e.g.}, photographs, newspapers, books). The rise of deep learning has led to the gradual incorporation of neural network designs into inverse halftoning methods. Most of existing inverse halftoning approaches adopt the U-net architecture, which uses an encoder to encode halftone prints, followed by a decoder for image reconstruction. However, the mainstream supervised learning paradigm with element-wise regression commonly adopted in U-net based methods has poor generalization ability in practical applications. Specifically, when there is a large gap between the dithering patterns of the training and test halftones, the reconstructed continuous-tone images have obvious artifacts.
most DSNs, covering various practical applications, such as camera networks for sports game videos capturing and
We thank the reviewers for all of these valuable comments. We provide point-by-point responses below. Re: generalize to other applications. Cooperative Navigation problem (Lowe et al. '17) and achieved a competitive mean reward (-4.8) against MADDPG Specifically, the stochastic target selection will make the executor inefficient to learn. We will further discuss the factors of each component in the next revision.
The study of the generalization of 2
We thank the reviewers for their constructive and positive comments. They will improve the quality of the paper. As an instance in RL, we mention the problem of "active exploration in MDPs" (see [28]), where the Reiterating the discussion in Section 2.3, let us consider the small-budget regime, and We will provide a footnote in page 7 to clarify this. This is indeed a nice remark. As a result, the theorem is valid even if irreducibility and aperiodicity are dropped.
A common concern from all reviewers is
We kindly thank the reviewers for their detailed reviews, valuable feedback and suggestions for improvement. Indeed, our proof of the new SW theorem relies on an "ordering" of the coordinates of arbitrary equivariant SW theorem under arbitrary finite group action would be desirable, however the proof is out of our reach as of today. In a way, this limitation is similar to the distinction between "point clouds" (which in We will add this discussion in the paper, and mention it in the abstract. In its "deep" original version, it covers all type of "Message-Passing" GNNs, but not spectral GNNs which use powers of the adjacency matrix. We will clarify this in the final version.
Adjustable AprilTags For Identity Secured Tasks
--Special tags such as AprilT ags that facilitate image processing and pattern recognition are useful in practical applications. In close and private environments, identity security is unlikely to be an issue because all involved AprilT ags can be completely regulated. However, in open and public environments, identity security is no longer an issue that can be neglected. T o handle potential harm caused by adversarial attacks, this note advocates utilization of adjustable AprilT ags instead of fixed ones. Special tags that facilitate image processing and pattern recognition are useful in practical applications.
- North America > United States > Michigan (0.05)
- Asia > China > Shanghai > Shanghai (0.05)
Differentiating hype from practical applications of large language models in medicine -- a primer for healthcare professionals
MSC 804 5 - 0020 - 10 St. Louis, MO 63110 Financial Support: P30 - AR073752 Conflict of interest: No conflicts declared. Page 2 of 13 Roberson 2025 - LLMs in medicine Abstract The medical ecosystem consists of the training of new clinicians and researchers, the practice of clinical medicine, and areas of adjacen t research. There are many aspects of these domains that could benefit from the application of task automation and programmatic assistance. Machine learning and artificial intelligence techniques, including large language models ( LLMs), have been promised to deliver on healthcare innovation, improving care speed and accuracy, and reducing the burden on staff for manual interventions . However, LLMs have no understanding of objective truth that is based in reality. They also represent real risks to the disclo sure of protected information when used by clinicians and researchers. The use of AI in medicine in general, and the deployment of LLMs in particular, therefore requires careful consideration and thoughtful application to reap the benefits of these technol ogies while avoiding the dangers in each context. Page 3 of 13 Roberson 2025 - LLMs in medicine Abbreviations AI: Artificial intelligence CSP: Constrained solution problems EHR: Electronic health records GPT: Generative pre - trained transformer LLM: Large language model ML: Machine learning RAG: Retrieval - augmented generation Page 4 of 13 Roberson 2025 - LLMs in medicine What is a large language model? Large language models are one of the most hyped artificial intelligence technologies of the past few years. Buzz terms associated with them include artificial intelligence, machine learning, a nd deep learning.
- North America > United States > Missouri > St. Louis County > St. Louis (0.24)
- North America > United States > New York (0.04)
Planning with General Objective Functions: Going Beyond Total Rewards
Standard sequential decision-making paradigms aim to maximize the cumulative reward when interacting with the unknown environment., i.e., maximize \sum_{h 1} H r_h where H is the planning horizon. However, this paradigm fails to model important practical applications, e.g., safe control that aims to maximize the lowest reward, i.e., maximize \min_{h 1} H r_h . In this paper, based on techniques in sketching algorithms, we propose a novel planning algorithm in deterministic systems which deals with a large class of objective functions of the form f(r_1, r_2, ... r_H) that are of interest to practical applications. We show that efficient planning is possible if f is symmetric under permutation of coordinates and satisfies certain technical conditions. Complementing our algorithm, we further prove that removing any of the conditions will make the problem intractable in the worst case and thus demonstrate the necessity of our conditions.
SUMART: SUMmARizing Translation from Wordy to Concise Expression
We propose SUMART, a method for summarizing and compressing the volume of verbose subtitle translations. SUMART is designed for understanding translated captions (e.g., interlingual conversations via subtitle translation or when watching movies in foreign language audio and translated captions). SUMART is intended for users who want a big-picture and fast understanding of the conversation, audio, video content, and speech in a foreign language. During the training data collection, when a speaker makes a verbose statement, SUMART employs a large language model on-site to compress the volume of subtitles. This compressed data is then stored in a database for fine-tuning purposes. Later, SUMART uses data pairs from those non-compressed ASR results and compressed translated results for fine-tuning the translation model to generate more concise translations for practical uses. In practical applications, SUMART utilizes this trained model to produce concise translation results. Furthermore, as a practical application, we developed an application that allows conversations using subtitle translation in augmented reality spaces. As a pilot study, we conducted qualitative surveys using a SUMART prototype and a survey on the summarization model for SUMART. We envision the most effective use case of this system is where users need to consume a lot of information quickly (e.g., Speech, lectures, podcasts, Q&A in conferences).
Challenges and Trends in Egocentric Vision: A Survey
Li, Xiang, Qiu, Heqian, Wang, Lanxiao, Zhang, Hanwen, Qi, Chenghao, Han, Linfeng, Xiong, Huiyu, Li, Hongliang
With the rapid development of artificial intelligence technologies and wearable devices, egocentric vision understanding has emerged as a new and challenging research direction, gradually attracting widespread attention from both academia and industry. Egocentric vision captures visual and multimodal data through cameras or sensors worn on the human body, offering a unique perspective that simulates human visual experiences. This paper provides a comprehensive survey of the research on egocentric vision understanding, systematically analyzing the components of egocentric scenes and categorizing the tasks into four main areas: subject understanding, object understanding, environment understanding, and hybrid understanding. We explore in detail the sub-tasks within each category. We also summarize the main challenges and trends currently existing in the field. Furthermore, this paper presents an overview of high-quality egocentric vision datasets, offering valuable resources for future research. By summarizing the latest advancements, we anticipate the broad applications of egocentric vision technologies in fields such as augmented reality, virtual reality, and embodied intelligence, and propose future research directions based on the latest developments in the field.
- Europe > Switzerland > Zürich > Zürich (0.14)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Asia > China > Sichuan Province > Chengdu (0.04)
- (13 more...)
- Overview (1.00)
- Research Report > Promising Solution (0.45)
- Instructional Material > Course Syllabus & Notes (0.45)
- Leisure & Entertainment (1.00)
- Information Technology (1.00)
- Health & Medicine > Therapeutic Area (0.67)
- (3 more...)